o 

't. : 



s : : 



yj 



Attorney Docket: M-9564 I ^ ^ 

697678 v2 ^ EL699365663US 



Buffer Management For Communication Systems 

Inventors: 

Onkar Sangha, Ed Kwan, Vijay Maheshwari 

& 

Akjhiro Kiuchi 



BACKGROUND 



10 Field Of Invention 



The present invention relates to data communications and, more particularly, 
to a method and system for managing data buffers in a communication system to 
promote communication among multiple networks. 

Copyright And Trademark Notice 

15 A portion of the disclosure of this patent document may contain material that 

is subject to copyright protection. The owner has no objection to the facsimile 
reproduction by any one of the patent document or the patent disclosure, as it 
appears in the Patent and Trademark Office patent file or records, but otherwise 
reserves all copyrights whatsoever. 

20 Certain marks referenced herein may be common law or registered 

trademarks of third parties affiliated or unaffiliated with the applicant or the 
assignee. Use of these marks is for the purpose 6f providing an enabling disclosure 
by way of example and shall not be construed to limit the scope of this invention to 
items or services associated with such marks. 

25 Related Art 

With the advancement of digital and communication technology, large 
volumes of information are transferred to various destinations in communication 
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networks. Communication networks are established and designed to promote the 
transfer and delivery of information to these various destinations in an efficient and 
reliable manner. Various communication networks transfer information based on 
different communication protocols. As such, systems and methods are developed to 
5 handle communication packets transmitted between multiple networks, such that 
information included in the packets assembled using one communication protocol 
can be recognized by the destination device or network that may communicate using 
a different communication protocol. 

Switching communication packets received from a source and routing these 
y| 10 packets to the destination requires the packets to be temporarily stored in a data 

storage management system. A buffer management system is typically used at the 
O processing stage. A buffer manager, typically, manages communication packets by 

jTj storing limited descriptor data associated with the packets in a fast data storage 

%i medium such as an on-chip memory, and storing the rest of the data associated with 

Q 15 the packets in slower external memory. On-chip memory is storage medium 

=== 

l=j integrated on data processing circuitry, while external memory is not, therefore data 

stored in on-chip memory can be accessed faster by the data processing unit on the 
U= data processing circuitry. 

Due to high costs and overhead associated with manufacturing small size on- 
20 chip memory, the on-chip memory implemented in a buffer management system 
needs to be limited to a reasonable size. However, processing speed is dependent 
upon successfully storing and quickly accessing descriptor data from on-chip 
memory. When managing high traffic communications, communication packets 
may be dropped, due to the on-chip memory reaching a maximum capacity. As 
25 such, this size limitation may result in data loss and can substantially reduce the 
throughput of the buffer management system. 

Thus, a system and method are needed that can address the size limitations 
associated with storage of descriptor data in on-chip memory. 
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In accordance with this invention, methods and systems for managing data 
packets in various communication networks are provided. Data packets assembled 
in one communication network can be destined for nodes in one or more other 
communication networks. During transmission, data packets may have to be 
segmented and reassembled before they reach their destination depending on 
communication protocols used at the sending and receiving network nodes. 
Communication systems (e.g., switches, routers, and gateways) are used to segment 
and/or reassemble communication packets in accordance with communication 
protocols of the various communication networks. 

To reassemble or segment the packets, a communication system processes 
the data as it arrives. However, in many instances, the communication system may 
not be fast enough to process packets as quickly as they arrive. Therefore, the 
packets or information included in them are temporarily stored in a data storage 
medium until they are processed. In embodiments of the system, one or more data 
storage units, referred to as data buffers, are utilized to store communication packets 
or data associated with such packets. Data buffers are data storage locations 
allocated in memory. 

Embodiments of the system are directed to managing data buffers such that 
communication packets stored in the buffers are efficiently processed, segmented, 
reassembled, and forwarded to appropriate network destinations. In one or more 
embodiments, the system includes one or more queues that are used to reference and 
store information associated with the communication packets. Each queue may be 
implemented either as on-chip memory or in external memory. In certain 
embodiments, the queues are implemented as a combination of the two. Due to the 
overhead associated with managing data transfer to and from external memory, 
storing and accessing data in on-chip memory is faster than storing or accessing data 
in external memory. Unfortunately, because on-chip memory is expensive to 
manufacture in comparison with external memory, to save costs, it is desirable to 
limit the size of on-chip memory to a minimum. 
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In a high traffic communication system, however, this size limitation may 
adversely effect the speed at which communication packets can be processed. As 
described in further detail below, one queue may be used for storing pointers (i.e., 
memory addresses) to a data buffer that includes a received communication packet, 
5 while another queue may be used for storing pointers to a data buffer that includes a 
communication packet ready to be transmitted to its destination. If a queue 
implemented in on-chip memory and used for storing received communication 
packets is full, then the system will not be able to receive any more packets until the 
information currently stored in the on-chip queue are processed. This inability to 

_ 10 support newly arriving communication packets can lead to communication packets 

J3 being dropped, and slow down the communication speed. 

H In accordance with one aspect of the system, to avoid transmission 

inefficiencies and data loss that may occur due to size limitations associated with on- 
SJ chip queues, the content of the queues is transferred to external memory when the 

g 15 queues reach a maximum threshold level. In some embodiments, the content 
fi; transferred to the external memory is then transferred back to the same on-chip 

S I | 

y| queue or other on-chip queues included in the system when needed. The on-chip 

£7 queues included in the system are: Read Free Queue (RFQ), Write Free Queue 

(WFQ), Receive Ready Queue (RRQ), and Transfer Ready Queue (TRQ). 

20 RFQ is implemented to include free data pointers. Free data pointers are 

memory addresses that point to fixed-size empty data buffers allocated in external 
memory. When a packet is received, the I/O core requests a free data pointer from 
RFQ and stores the packet in the data buffer associated with that data pointer. The 
I/O core then constructs a buffer descriptor (BD) that contains header information 

25 associated with the communication packet but not the packet's payload. The 

constructed BD is truncated to minimize storage requirements and is stored in RRQ. 

The CPU receives an interrupt to read RRQ when RRQ is not empty. CPU 
reads a BD and based on information stored therein determines whether a 
communication packet associated with the BD is to be transmitted to a different I/O 
30 port. Handling data packets through the associated BDs speeds up packet processing 
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in that the CPU avoids the resource consuming process of accessing the external 
memory to retrieve needed data. All processing related data is included in BDs 
stored in RRQ implemented in on-chip memory. 

If a communication packet is to be transferred to a different I/O port, the 
5 associated BD is transferred from RRQ to a TRQ associated with the destination 
port. In embodiments of the system, each I/O port is associated with a specific TRQ 
and WFQ. TRQ is implemented to store BDs that are to be transmitted via an I/O 
port. WFQ is implemented to store freed data pointers pointing to data buffers in 
_ external memory after the communication packets stored therein are processed and 

y3 10 transmitted. Free data pointers stored in WFQ are returned to RFQ in due course as 

^1 described below. 

G 

In embodiments of the system, data buffer queues have configurable 

I = ! 

■srsr 

minimum and maximum thresholds. These thresholds are used to monitor and 
JL maintain an optimal storage level in the queue. For example, when a maximum 

N; 15 threshold is reached, data pointers or BDs stored in the queues are transferred to 
Ln separate queues in external memory. When a minimum threshold is reached, data 

y pointers or BDs stored in external memory, if any, are transferred to corresponding 

queues. For example, data pointers transferred to external memory from a WFQ 

associated with one I/O are returned to that I/O's RFQ. The expansion of data 
20 pointers and BDs to external memory allows the buffer manager to handle high 

traffic communication without substantially compromising processing speed or 

throughput. 

In certain embodiments of the system, TRQs associated with different I/O 
ports are read based on one or more priority schemes to comply with quality of 

25 service requirements of I/O streams received on various ports. Other embodiments 
of the system are implemented to handle communication packets received via a 
single I/O port that includes interleaved streams. Interleaving is a communication 
scheme wherein a single communication stream includes packets of data that are 
generated from more than one source or packets of data that are destined for more 

30 than one destination. 
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An example of a communication protocol supporting such streams is the 
ATM protocol. Embodiments of the system support 32 ATM channel streams. The 
I/O core detects the channel associated with each received packet and constructs a 
BD that includes channel information. Since packets for different channels are 
5 interleaved in the same stream, the last packet associated with each channel includes 
a field identifying it as the last packet. Thus, a BD also includes information 
indicating whether a packet is the last packet received on a certain channel. The BD 
is stored in RRQ by the I/O core and the CPU is interrupted to read and process the 
BD. 

10 One or more embodiments of the system include separate and distinct BD 

queues implemented in external memory that correspond to different ATM channels, 
for example. The CPU reads a BD from RRQ and during processing the BD 
examines the channel information included therein. The BD is then stored in the 
queue in external memory corresponding to the channel identified by the BD. The 

1 5 CPU also examines the BDs to determine whether the packet associated with the BD 
is the last packet in the communication stream received on a certain channel. When 
the CPU detects the last packet it notifies the driver associated with the application 
or port to which the ATM stream is directed. Thus, the data packets are delivered to 
the appropriate destination. 

20 The invention is not limited to any particular embodiments described above. 

Rather, the embodiments provided herein are intended to be illustrative, but not 
limiting, of the scope of the invention. 
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BRIEF DESCRIPTION OF THE DRAWINGS 
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FIG. 1 illustrates a communication system wherein voice and data are 
transmitted among multiple networks through a communication system, in 
accordance with one or more embodiments of the invention. 

FIG. 2 illustrates a block diagram of the components of the communication 
system of FIG. 1 , in accordance with one or more embodiments of the invention. 

FIG. 3 A illustrates a data pointer 310 associated with a data buffer in 
external memory, in accordance with one or more embodiments of the invention. 

FIG. 3B illustrates a buffer descriptor associated with a communications 
packet, in accordance with one or more embodiments of the invention. 

FIG. 4 illustrates an example of different configurations for the transmit 
ready queue implemented in accordance with one aspect of the invention. 

FIG. 5 illustrates a block diagram of dual port memories that can be used in 
implementation of on-chip memory queues of the system of FIG. 1, in accordance 
with one or more embodiments of the system. 

FIG. 6 illustrates a flow diagram of a method for receiving communication 
packets from one or more I/O ports, according to one or more embodiments of the 
system. 

FIG. 7 illustrates a flow diagram of a method for processing received 
communication packets from an I/O port, in accordance with one aspect of the 
system. 

FIGS. 8B and 8D through 8F illustrate flow diagrams of methods of 
transferring information between external memory and on-chip memory queues, in 
accordance with one or more embodiments of the invention. 
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FIG. 8C illustrates a flow diagram of a method of monitoring the integrity of 
received data, in accordance with one or more embodiments of the invention. 

FIG. 9 illustrates a method for transmitting a communication packet to a 
designated destination, in accordance with one or more aspects of the system. 

FIG. 10 is a block diagram illustrating separate expansion buffers allocated 
in external memory for each I/O port where data pointers are temporarily stored 
before the data pointers are transferred to the on-chip memory queue from which the 
data pointers were originated. 

FIG. 1 1 illustrates a block diagram of a buffer management queue having a 
write FIFO and read FIFO, in accordance with one or more embodiments of the 
system. 

FIG. 12 illustrates a block diagram of an on-chip queue, in accordance with 
one or more embodiments of the invention, for storing buffer descriptors associated 
with communication packets received as a part of an interleaved communication 
stream. 

FIG. 13 illustrates a flow diagram of a method of sorting buffer descriptors 
into respective on-chip memory queues in external memory during processing and 
transferring communication packets to higher communication layers. 

FIG. 14 illustrates a block diagram of the buffer manager of the system and 
the interface signals in accordance with one embodiment of the system. 
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DETAILED DESCRIPTION 

In the following detailed documentation, numerous specific details are set 
forth with respect to one or more embodiments of the invention. Of course, the 
invention may be practiced without some or all of these specific details or with 
5 variations in those details. In some instances, certain well known features have not 
been described in detail so as not to obscure other aspects of the invention. 

System Architecture 

q FIG. 1 illustrates a communication system wherein voice and data are 

transmitted among multiple networks through a communication system or Integrated 
SI 10 Access Device (IAD) 110. IAD 110 multiplexes a variety of communication 
y5 technologies onto one or more communication lines for transmission to a carrier 

(e.g., the local telephone company). As shown, in accordance with one or more 
s embodiments of the system, IAD 110 promotes communication of voice and data 

£1 between Local Area Network (LAN) 120, telephones 1 and 2, and DSL Access 

15 Multiplexor (DSLAM) 130. DSLAM 130 is a central office device that intermixes 

□ voice and Digital Subscriber Line (DSL) traffic in a Wide Area Network (WAN). 

p= 

DSL is a communication technology that dramatically increases the digital 
capacity of ordinary telephone lines and allows for faster data communication. DSL 
technology is directed to two types of usage. Asymmetric DSL (ADSL) is for 

20 Internet access, where fast data download is required, but slow data upload is 
acceptable. Symmetric DSL (SDSL, HDSL, etc.) is designed for short haul 
connections that require high speed in both directions. In embodiments of the 
invention, communication between DSLAM 130 and IAD 1 10 is accomplished via 
the above-referenced communication methods or other well know communication 

25 methods, such as Frame Relay and Asynchronous Transfer Mode (ATM), for 
example. 

Frame Relay is a high-speed packet switching protocol used in WANs. 
Frame relay provides permanent and switched logical connections, known as 
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Permanent Virtual Circuits (PVCs) and Switched Virtual Circuits (SVCs). PVCs are 
logical connections provisioned ahead of time, while SVCs are provided on demand. 
The connections are identified by a Data Link Connection Identifier (DLCI) number. 
Every DLCI requires a Committed Information Rate (CIR) that defines a certain 
5 amount of transmission capacity for the connection. In contrast, ATM is a network 
protocol for both LANs and WANs that supports real-time voice and video as well 
as data. ATM uses switches that establish an end to end logical circuit. ATM 
guarantees a quality of service (QoS) for a transmission because it transmits all 
traffic in fixed-length packets that are easier to process in comparison with variable- 
O 10 length packets. 

C\ As illustrated in FIG. 1, various communication protocols (i.e., standards) 

C3 can govern data communication in various segments of a network. IAD 110 

Id includes one or more I/O ports to communicate with the various network segments. 

J 3 Information is grouped into communication packets in various formats based on the 

P 15 communication protocol used in each network segment. In embodiments of the 

ry system, IAD 1 1 0 is implemented so that it can be programmed to process 

%} communication packets received from various network segments and forward them 

M= to the designated destination in the proper format. 

Processing a communication packet sometimes involves segmenting the 
20 packet into smaller portions and reassembling the smaller portion into newly formed 
packets, such that the new packet formats are recognizable by the destination device 
or network segment. This transition requires storing information contained in the 
packet and routing the information to an I/O port associated with the destination 
network segment. IAD 110 includes buffer manager 100 that receives, segments, 
25 and reassembles communication packets. Buffer manager 1 1 0 further prioritizes the 
storage and transfer of information contained in the packets in an efficient manner 
between various I/O ports. In certain embodiments of the system, each I/O port 
works in conjunction with a separate buffer manager. 

As illustrated in FIG. 2, LAD 110 includes memory bus (MBUS) 210, 
30 control bus (CBUS) 220, buffer manager 100, external memory 240, MBUS 
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interface 250, Direct Memory Access (DMA) engine 280, and I/O core 290. I/O 
core 290 receives and transmits communication packets from receive and transfer 
lines 291 and 292, respectively. 

Buffer manager 100 includes First In First Out (FIFO) queues 230, CBUS 
5 interface and registers 260, and Input/Output (I/O) interface 270. I/O core 290 
communicates with buffer manager 100 via I/O interface 270. In certain 
embodiments of the system, the DMA engine 280 interfaces with buffer manager 
100 and initiates data transfers between FIFO queues 230 and external memory 240 
via MBUS 210. A central processing unit (CPU), not shown, communicates with 
1 0 FIFO 230 and DMA engine 280 via CBUS interface and registers 260 through 

CBUS 220. Embodiments of the invention may include more than one CPU. The 
CPU may be a RISC (Reduced Instruction Set Computer) based processor or other 
processor suitable for performing data communications related tasks. 

The CPU examines a communication packet and makes the appropriate 
15 changes to memory tables and data structures to allow forwarding of packets in the 
appropriate format and to the designated destinations. In embodiments of the 
system, a communication packet is stored in a data structure called a data buffer. A 
data buffer is, typically, a pre-allocated range of addresses in external memory (e.g., 
dynamic RAM memory) reserved for storage of data. In some embodiments, data 
20 buffers are fixed in size and are set during configuration. For example, a 64 

Kilobyte memory may be divided (i.e., partitioned) into 64 1024-byte data buffers or 
alternatively, 128 512-byte data buffers, depending on the type of communication 
packet being processed. A communication packet is stored in one or more data 
buffers, depending on the size of the packet and the data buffer. That is, if the size 
25 of a communication packet is greater than the buffer size, then more than one data 
buffer is consumed to store the packet. 

In one or more embodiments, the system transfers communication packets 
among one or more I/O ports that communicate with different network segments. 
Different type of communication packets differ in length. Therefore, depending on 
30 the communication protocol used, in certain embodiments of the system, the size of 
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the data buffers is programmable based on the type of communication protocol used. 
For example, in case of ATM I/O and Ethernet I/O, data buffers may be 
programmed to hold 512 bytes, or 2048 bytes respectively. 

As communication packets are processed by the CPU, the packets may need 
5 to be copied or moved from one memory location (e.g., data buffer) to another. 
Moving data from one memory location to another involves deleting the original 
data and copying it to another memory location. Therefore, the speed at which 
communication packets are processed depends greatly on the CPU's processing 
speed and the speed at which data can be fetched from the data storage medium. 
10 Directly handling data stored in a data buffer is relatively slow because the data is 
stored in external memory. Thus, in embodiments of the system, communication 
packets are accessed indirectly in association with a data pointer. FIG. 3 A illustrates 
a data pointer 310 associated with (i.e., pointing to) a data buffer 330 in external 
memory 240. 

15 FIFO 230 includes a number of, preferably, on-chip memory queues that are 

used to store one or more data pointers 310 that point to one or more data buffers 
330 in external memory 240. On-chip memory is an array of memory cells typically 
arranged in rows and columns and formed as a part of an integrated circuit. Each 
cell holds a value in the form of a digital bit. Information stored in on-chip memory 

20 can be accessed and processed at a faster rate in comparison to information stored in 
external memory because the CPU can avoid arbitration delays and overhead 
associated with accessing the information via a memory bus. Unfortunately, due to 
its relatively expensive cost, on-chip memory is not an economically viable medium 
for storage of large volumes of data. Referring to FIG. 3B, to increase data access 

25 bandwidth, communication packets are handled indirectly by means of buffer 

descriptors (BDs) 300. BDs 300 are transferable to external memory in accordance 
with one or more embodiments of the system. 

A BD 300 is a data structure that contains information about certain 
characteristics of a communication packet (e.g., size, type, destination) so that buffer 
30 manager 100 can handle the packet. BD 300 at least includes a data pointer 3 1 0 to 
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data buffer 330. However, BD 300 does not include data stored in the 
communication packet's payload. Payload, typically, refers to a part of a 
communication packet that holds the message data in contrast to the headers that 
provide control information about the communication packet. For example, BD 300 
5 contains information about a communication packet to adequately identify and 
describe the packet's length, destination, and memory storage location needed for 
completion of the communication. 

The system handles a variable length communication packet by associating it 
with a fixed size BD that is much smaller than the length of the entire packet. For 
2 10 example, in accordance with one aspect of the invention, BD 300 can be 64 bits (or 
y_ 8 bytes) long, where a communication packet can be up to 64K bytes in size. 

Q Handling (e.g., accessing, retrieving, moving and copying) a 64-bit BD stored in on- 

jTj chip memory is considerably faster than handling an entire communication packet 

stored in a data buffer 330 in external memory 240. 

O 

1 5 In one or more embodiments of the system, BD 300 includes information 

j % about a packet's status, length, and storage location in memory. The storage 

B location in external memory is represented by a data pointer 310. Data pointer 3 1 0 

points to a memory address in external memory 240 where data buffer 330 
containing the communication packet begins. Data pointer 3 10 in certain 
20 embodiments of the invention is a 32-bit pointer, for example. FIG. 3B illustrates an 
exemplary BD 300 that contains a 32-bit data pointer 310, in addition to the other 
fields. 

The status field is used to report the results of a Cyclical Redundancy Check 
(CRC) after the communication packet is received. CRC is an error checking 
25 technique used to ensure the accuracy of transmitted and received digital data. In 
embodiments of the invention, for example, the transmitted data in predetermined 
lengths is divided by a fixed divisor. The remainder, if any, of the calculation is 
appended onto and sent with the data. At the receiving end, the remainder value for 
the received data is recalculated based on using the same divisor. If the newly 
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calculated remainder value does not match the transmitted remainder, an error is 
detected that indicates data was corrupted during transmission. 

The length field indicates the length of the communication packet. Certain 
embodiments of the invention may include fields in addition or in exclusion to the 
fields mentioned above. For example, in some embodiments, BD 300 may include a 
5 -bit hashing field for faster table lookup searches of destination addresses. In some 
embodiments, a 2-bit type field, for example, may be included to indicate a BD 
location (e.g., first, middle, last) in the chain of BDs constructed for a packet. 
Depending on the length of the packet, there may be one or more BDs constructed 
for a packet. A destination field, for example, may indicate the destination port or 
ports to which the packet is to be transmitted during bridging or switching. An 
identifier field, for example, may be used to indicate the I/O port (e.g., Ethernet or 
USB port) from which a BD originates. Tables 1 and 2 provide examples of buffer 
descriptors for two different communication protocols in accordance with one or 
more embodiments of the invention. Other formats and fields may be included. 



TABLE 1 : Exemplary Buffer Descriptor for an ATM packet 



15 14 13 12 11 


10 9 8 7 6 5 4 


3 2 10 


Virtual circuit channel number 


Status 


Identifier | Type 


Length 


Data Pointer 


Data Pointer 


Data Pointer 


Data Pointer 



TABLE 2: Exemplary Buffer Descriptor for an Ethernet packet 



15 14 13 12 11 


10 9 8 7 6 5 4 


3 2 10 


Destination Address Hash 


Status and control | B | A 


Identifier | Type 


Source Address Hash 


Length 


Data Pointer 


Data Pointer 


Data Pointer 


Data Pointer 



Table 3 is an example of bit combinations that can be used to indicate a 
packet type, in accordance with one or more embodiments of the invention. If a 
communication packet is larger than the size of the data buffer then the information 
contained in the communication packet is stored in more than one data buffer. The 
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type information is included in the BD to indicate whether a data buffer is the first, 
middle, or last in the chain of data buffers. 



TABLE 3: Exemplary BD Type Definition 



Bit 1 


Bit 2 


Type 


0 


0 


Entire communication packet stored in single BD 


0 


1 


First in chain 


1 


0 


Middle of chain 


1 


1 


Last in chain 



Referring back to FIG. 2, in accordance with one aspect of the invention, 
FIFO 230 includes one or more of the following queues: Read Free Queue (RFQ) 
232, Receive Ready Queue (RRQ) 234, Transmit Ready Queue (TRQ) 236, and 
Write Free Queue (WFQ) 238. RFQ 232 and WFQ 238 are used to store one or 
more data pointers 310. RRQ 234 and TRQ 236 are used to store one or more data 
buffers 330. The above FIFO queues may be implemented on-chip or in external 
memory. However, in embodiments of the system for optimum performance, all 
buffer management queues in FIFO 230, with the exception of the data buffers, 
reside in on-chip memory (e.g., hardware). Table 4 provides an example of the 
depth and size that can be assigned to each queue, in accordance with one aspect of 
the invention. To meet QoS requirements, certain embodiments of the system 
include four TRQ 236s, for example, as further described below. 



TABLE 4: Size of On-Chip Memory for FIFO Queues 



Queue 


Depth 


Size in bytes 


RFQ 


128 


128x2=256 


RRQ 


128 


128x5=640 


TRQ 


32 


32x5=160 
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WFQ 



32 



32x2=64 



Read Free Queue 

RFQ 232 includes a list of free (i.e., empty) data pointers that point to data 
buffers in external memory. When a communication packet is received, I/O core 
290 interacts with I/O interface 270 to read an available data pointer from RFQ 232. 
An available data pointer is a data pointer that points to an empty storage location 
allocated in external memory called a data buffer. Some or all the information in the 
received communication packet is stored in one or more data buffers to which the 
read data pointer points. Similarly, after a communication packet is assembled by 
the CPU and is ready for transmission, the CPU interacts with CBUS interface and 
registers 260 to read an available data pointer from RFQ 232. Some or all of the 
information in the assembled communication packet is stored in a data buffer to 
which the read data pointer points, until such time that the communication packet 
can be transmitted. 

In accordance with one aspect of the invention, once information stored in a 
data buffer is transmitted, the data pointer pointing to that data buffer is said to be 
freed. At that point, I/O core 290 writes the freed data pointer to WFQ. When the 
number of freed data pointers stored in WFQ 238 reaches a maximum threshold, the 
buffer manager's control program asserts an interrupt to the CPU. The control 
program can be implemented in form of firmware or software, in accordance with 
one or more embodiments of the system. The CPU sends a DMA command through 
CBUS interface and registers 260 to transfer freed data pointers from WFQ 238 to 
external memory 240 via MBUS 210. When the number of freed data pointers 
stored in RFQ 232 reaches a minimum threshold, the buffer manager control 
program asserts an interrupt to the CPU. The CPU sends a DMA command through 
CBUS interface and registers 260 to transfer freed data pointers from external 
memory to RFQ 232 via MBUS 210. 
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Referring to FIGS. 3 A and 3B, in accordance with one aspect of the 
invention, the control program assigns 32-bit data pointers to point to data buffers 
allocated in external memory at the time of configuration. To save in design and 
manufacturing costs, however, data pointers are truncated and stored in RFQ 232 
with fewer number of bits. The truncation is performed based on the size of data 
buffers and the manner of allocation of data buffers in the external memory, as 
described below by way of example. 

To illustrate, consider an 8 Megabyte memory space that is represented by a 
32-bit memory address. In accordance with one aspect of the system, an 8- 
Megabyte memory space is partitioned into 8000 1024-byte data buffers. Since the 
memory space is 8 Megabytes, a 23 -bit data pointer is sufficiently large to address 
the memory space. In accordance with one aspect of the system, the data buffers are 
aligned on 128-byte address boundaries. As a result, the 7 least significant bits of a 
32-bit data pointer can be set to zero, for example, because they are not significant 
for memory addressing purposes. 

The most 9 significant bits of the 32-bit data pointer define the particular 8- 
Megabyte memory space and thus are common to all data pointers. In accordance 
with one embodiment of the system, the most significant 9 bits can be stored in a 
base register, allowing a 32-bit data pointer to be stored in a 16-bit data buffer. 
Other implementations and truncation schemes are possible. As such the above 
description should not be construed to limit the scope of the invention to the 
provided example. Table 5 illustrates the format of an exemplary data pointer 
address, in accordance with one or more embodiments of the invention. 

TABLE 5: Exemplary 32 bit Data Pointer Address 



9 most significant bits stored in base register 



1 6 bits stored in Queue 



7 least significant bits 



25 
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Receive Ready Queue 



RRQ 234 contains a list of ready (filled) BDs 300 that are constructed and 
associated with communication packets received by the system, as discussed earlier. 
A ready BD 300 includes a data pointer and other necessary information to reference 
a data buffer in external memory that stores one or part of a received communication 
packet. In accordance with one or more embodiments of the system, I/O core 290 
queues ready BDs 300 to RRQ 234 by interacting with I/O interface 270. The CPU 
reads ready BDs 300 from RRQ 234 by interacting with CBUS interface and 
registers 260. 

In certain embodiments of the system, BD 300 is 64 bits but is truncated to 
46 bits when it is stored in RRQ 234 for processing efficiency. This is accomplished 
by truncating the 32-bit data pointer into a 1 6-bit data pointer as described above. 
The status bits can be truncated from 4 bits to 2 bits, because only 2 bits of the status 
field are used, as it is described in further detail below. 

Transmit Ready Queue 

TRQ 236 contains a list of ready (filled) BDs 300 that have been processed 
by the CPU and are associated with communication packets that can be transmitted 
to a destination. The CPU queues ready BDs to TRQ 236 and I/O core 290 reads 
from TRQ 236. One or more embodiments of the invention include more than one 
20 (e.g., four) TRQ 236s (not shown) in order to meet QoS requirements for timely 
delivery of various types of data. By assigning priority values to multiple TRQ 
236s, various types of data (e.g., voice, high priority data, network management 
data) can be transmitted based on predetermined priorities programmed into the 
system. For example, voice packets can be routed through a first queue with the 
25 highest assigned priority, while other data can be routed through second, third, and 
forth queues with lower priorities. 

For optimal performance the size of each TRQ 236 can be configured to one 
or more fixed configurations, when the system is initially configured. FIG. 4 
illustrates an example of different configurations implemented in accordance with 
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one aspect of the invention. For example, in one configuration (e.g., bits 7 and 8 
being 00), each of the four queues are configured to be 32 BDs deep. In another 
configuration (e.g., bits 7 and 8 being 01), first and second queues are 40 BDs deep 
each, and third and forth queues are 24 BDs deep each. In the latter configuration, 
5 for example, the first two queues are used to handle higher priority traffic over the 
last two queues. As illustrated in FIG. 4 other configurations are also possible. 

It should be noted that the configurations described here and illustrated in 
FIG. 4 are provided by way of example. The system is programmable to support 
various configurations depending on the type of communication packets processed. 
1 0 Further, different transmission priority algorithms may be used. For example, in 

configuration "00" a round robin approach may be used to transmit one packet from 
each queue at a time. In configuration "01", for example, the 24-bit queues may 
have priority over the 40-bit queues as long as the 24-bit queues are full. Other 
algorithms are possible. 

1 5 The priority algorithms used for the TRQ 236s can be specific to each type 

of I/O to which buffer manager 100 is connected. For example, the Ethernet I/O 
core can read from the TRQ 236s according to one set of rules, while the CPU can 
read from the TRQs according to another set of rules. In certain cases, buffer 
manager 100 is implemented so that the priorities of each TRQ 236 are dynamically 

20 configurable. 

Write Free Queue 

WFQ 238 contains a list of free (empty) data pointers 310 returned by I/O 
core 290 and the CPU. DMA engine 280 reads from data pointers 310 from WFQ 
238 to external memory 320. In embodiments of the system, RFQ 232 and WFQ 
25 238 are combined in a 128x32 bit dual port memory, for example. A dual port 
memory allows for information to be written and read to and from memory 
simultaneously between the CPU, I/O core 290, and DMA engine 280. 

It is noteworthy that RFQ 232 and WFQ 238 may be physically implemented 
in a single on-chip memory architecture that is logically divided into two separate 
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queues. In a certain embodiment of the invention, the queues are implemented so 
that RFQ 232 can be only read from, and WFQ 238 can be only written to by the 
CPU or I/O core 290. Further, RRQ 234 and TRQ 236 may be also combined in a 
256x46 dual port memory. 

FIG. 5 illustrates a block diagram of dual port memories 510 and 520 each 
having a Port A and a Port B, in accordance with one or more embodiments of the 
system. Port A is in communication with the CPU and I/O core 290 via CBUS 
interface and registers 260 and I/O interface 290. Port B is in communication with 
external memory 240 via MBUS interface 250 (FIG. 2). CBUS interface and 
registers 260 includes one or more registers for temporarily storing data that are to 
be read from or written to FIFO queues 230. 

Using this scheme, request for accessing data buffers in external memory is 
granted immediately and the content of FIFO queues 230 is updated later, 
accordingly. A review of Appendices A and B provides a better understanding of 
the specifications of registers included in interfaces 250, 260 and 270 and the signals 
implemented, in accordance with one or more embodiments of the system. The 
registers provide an interface between the CPU, buffer manager 100, external 
memory 240, DMA engine 280, and I/O core 290. Appendices A and B are attached 
hereto and incorporated by reference in their entirety in this disclosure. 

20 System Software or Firmware 

In embodiments of the invention, a control program in form of software or 
firmware is stored in system memory and is executed by one or more CPUs to 
control the tasks of receiving, processing, and transferring communication packets to 
and from one or more I/O ports. FIGS. 6 through 9 illustrate flow diagrams of 
25 methods of handling communication packets forwarded to buffer manager 100. 

These methods can be implemented in the form of control hardware and/or control 
software, according to one or more embodiments of the system, as further described 
below. 
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Receive Method 



FIG. 6 illustrates a flow diagram of a method for receiving communication 
packets from one or more I/O ports, according to one or more embodiments of the 
system. At step 610, I/O core 290 receives a communication packet via receive line 
291 . I/O core 290 communicates with buffer manager 100's FIFO 230, via I/O 
interface 270. At step 620, I/O core 290 submits a request to buffer manager 100 to 
receive a free data pointer. A free data pointer points to a memory address in 
external memory and indicates the beginning of a data buffer (see FIG. 3 A). At step 
630, the control program determines if a free data pointer is available in RFQ 232. 
RFQ 232 is an on-chip memory that includes data pointers to free data buffers in 
external memory. In some embodiments, RFQ 232 after the system configuration 
includes 128 free data pointers, for example. If no free data pointers are available in 
RFQ 232, then the system handles an error at step 635. In embodiments of the 
system, if an error is encountered the received communication packet will be 
dropped. 

If a free data pointer is available, then at step 640 the control program 
provides I/O core 290 with a free data pointer to a data buffer in external memory. 
In certain embodiments of the system, at step 645, the control program monitors the 
availability of free data pointers in RFQ 232 after a data pointer has been read. 
Referring to FIG. 8, at step 810, the control program determines if RFQ 232 has 
reached a minimum threshold. If so, then at step 820 the control program asserts an 
interrupt to the CPU and the CPU sends a DMA command through CBUS interface 
and registers 260 to transfer freed data pointers from external memory to RFQ 232 
viaMBUS210. 

Referring back to FIG. 6, once I/O core 290 (shown in FIG. 2) receives a free 
data pointer, at step 650, I/O core 290 initiates a DMA transfer of data to the data 
buffer pointed to by the free pointer. In embodiments of the invention, I/O core 290 
includes a data queue with a configurable threshold. Once the data queue has 
reached a threshold (e.g., 64 bytes), a DMA transfer of data from the data queue to 
the data buffer in the external memory is initiated. 
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At step 660, I/O core 290 monitors the data buffer where the data is stored. 
If the control program detects that the data buffer is full or if an end of packet is 
detected, then at step 670 I/O core 290 generates a BD and includes in the BD the 
data pointer that points to the data buffer. As illustrated in FIG. 3B, in addition to 
5 the data pointer, other fields are also included in the BD for example to indicate the 
status, type, and length of information stored. 

In certain embodiments, depending on the type of I/O packet (e.g., Ethernet, 
USB, HDLC, or WAN, etc.) at step 675 the control program performs an integrity 
check on the data stored to ensure that the data received in a communication packet 
M 10 is intact. Referring to FIG. 8, at step 830, I/O core 290 performs a CRC on the data 
= to determine data integrity. If an error is detected then at step 840 the control 

^~ program handles the error by, for example, including an error code in the BD's 

SJ status field. 

3 s 

£7 After I/O core 290 has constructed a BD for the received packet, I/O core 

15 290 queues the BD in RRQ 234, at step 680. In certain embodiments of the system, 
p at step 685 the control program monitors RRQ 234's capacity threshold. Referring 

~ to FIG. 8, at step 850, the control program determines whether RRQ 234 has reached 

a maximum threshold. If so, at step 855 the control program DMA transfers one or 
more BDs from RRQ 234 to external memory. This process allows for the system to 
20 expand the storage of BDs from on-chip memory, in this case RRQ 234, to external 
memory. This expansion scheme is described in further detail below. 

Processing Method 

FIG. 7 illustrates a flow diagram of a method for processing received 
communication packets. In accordance with one aspect of the system, at step 710, 
25 I/O core 290 examines RRQ 234 to determine if it is empty. If RRQ 234 is not 

empty (i.e., includes a BD), at step 720, I/O core 290 generates an interrupt signal to 
the CPU to read one or more BDs from RRQ 234. At step 730, the CPU reads one 
or more BDs from RRQ 234. 
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In certain embodiments of the system after the CPU reads from RRQ 234, at 
step 735, the control program examines RRQ 234's storage level to determine if it is 
above or below a minimum threshold level. For optimal performance and to avoid 
delays associated with retrieving data from external memory, it is desirable to 
5 maintain a minimum number of buffer descriptors stored in each on-chip queues. 
Referring to FIG. 8, at step 860, the control program determines if RRQ 234 has 
reached the minimum threshold. If so, then at step 865 the control program causes 
one or more BDs to be DMA transferred from the external memory to RRQ 234. 

It should be noted that the DMA transfer can take place only if one or more 
o 10 BDs are transferred to the external memory as described in step 855. Transferring 

BDs from external memory to RRQ 234 when the minimum threshold is reached 
% f prevents RRQ 234 from reaching a close to empty status. If RRQ 234 is empty then 

yj the CPU will have to access BDs stored in external memory or wait for BDs to be 

j4 transferred from external memory to RRQ 234 before the CPU can process the BDs. 

s 15 In accordance with an aspect of the invention, by monitoring and maintaining a 
^ minimum threshold level, more BDs are transferred to RRQ 234 before RRQ 234 

;« reaches an empty status. As a result, the CPU can quickly access BDs stored in 

O RRQ 234 and does not have to access the external memory or wait for the BDs to be 

^ transferred from the external memory. 

20 Referring back to FIG. 7, after the CPU reads a BD from RRQ 234, the CPU 

then at step 740 processes information stored in the BD and/or information stored in 
the data buffer pointed to by the data pointer stored in the BD. In certain 
embodiments, the CPU examines the BD's status bits (i.e., the bits in the status 
field) for indication of any errors in transmission. If an error is detected then the 

25 communication packet is dropped by returning the data pointer to WFQ 238. 

Other fields in the BD designate the destination and the format in which the 
information is to be transferred. If no errors are detected, at step 750, the CPU 
analyzes the destination address to determine if the packet is to be transmitted to a 
different I/O port than it was received. If the information does not need to be 
30 retransmitted, then no action needs to be taken and at step 755 the data pointer 



23 



Attorney Docket: M-9564 1^^^ 
697678 v2 

associate with the packet is returned to WFQ 238. As described above, one or more 
TRQ 236's may be implemented, in accordance with one or more embodiments, to 
satisfy QoS requirements so that transmission of various I/O packets can be 
prioritized. 

In certain embodiments of the system, the control program at step 759 
examines WFQ 238 for maximum threshold level. Referring to FIG. 8, at step 870, 
the control program determines if WFQ 238 has reached a maximum threshold after 
the data pointer is returned to WFQ 238, at step 755. If so, then at step 875, the 
control software DMA transfers one or more data pointers from WFQ 238 to the 
external memory. 

In absence of a maximum threshold monitoring scheme, the control software 
will have to be implemented to DMA transfer each data pointer to external memory 
every time it is stored in WFQ 238. Frequent DMA transfers between on-chip and 
external memory are resource intensive. By monitoring a maximum threshold level, 
a plurality of data pointers may be transferred together rather than individually and 
via multiple DMA transfers, thereby saving valuable system resources. 

Referring back to FIG. 7, if at step 750, the control program determines that 
the received packet is to be transmitted via a different I/O port, then at step 760 the 
control program copies the BD associated with the packet to TRQ 236 (FIG. 2) 
associated with the destination I/O port. Once the BD is queued into TRQ 236, I/O 
core 290 is interrupted to poll TRQ 236. Referring to FIG. 9, as it is described in 
further detail below, at steps 950 through 960, core 290 (FIG. 2) reads a BD from 
TRQ 236 and returns the data pointer associated with the BD to WFQ 238 when the 
packet is forwarded to a network destination. 

In embodiments of the invention, each I/O port may be connected to a 
separate buffer manager. Therefore, it is noteworthy that in accordance with one or 
more aspects of the system, data packets can be handled indirectly by copying the 
BD associated with the packet from a RRQ 234 in a first buffer manager to a TRQ 
236 of a second buffer manager, as further described below. 
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Each buffer manager is programmable so that it can directly transfer a BD 
from one on-chip queue (e.g., RRQ 234) to another (e.g., TRQ 236) without having 
to copy the entire content of a data buffer from one memory location to another. 
Thus, the switching of a packet from one I/O port to another port can be 
accomplished in a highly efficient manner. 

In a network environment, the communication protocol that contains the 
physical address of a client or server station is referred to as Layer 2, the "data link 
layer," or the "Media Access Control (MAC) layer. In this layer, the destination 
address stored in a BD for a communication packet is inspected by a communication 
controller (e.g., a bridge or switch) for data routing purposes. The CPU determines 
if the communication packet being processed at step 740 is destined for the upper 
layers of the networking stack. If so, I/O core 290 sends the packet associated with 
the BD to its destination by copying the data pointer in the BD to data buffers 
maintained by the networking stack software (e.g., MBUF). 

For example, if a BD is received on a first Ethernet Port is to be forwarded to 
a second Ethernet Port, after the packet has been transmitted to the second Ethernet 
Port, I/O core 290 associated with the second Ethernet Port returns the data pointer 
to WFQ 238 associated with the first Ethernet Port. As it is further described below, 
this scheme ensures that data pointers are not lost between different buffer 
managers, so that the initial balance of data pointers assigned to each buffer manager 
at the time of initial configuration is maintained. 

Transmit Method 

FIG. 9 illustrates a method for transmitting a communication packet to a 
designated destination, in accordance with one or more aspects of the system. At 
step 910, a software application constructs a communication packet for transmission 
via a certain communication port. At step 915, the software application either 
directly or through a driver software submits a request for a free data pointer to 
buffer manger 100. At step 920, the control software determines if a free data 
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pointer is available in RFQ 232. If no free data pointers are available then the 
control software at step 922 either drops the packet or generates an error message. 

If a free data pointer is available, then at step 925 a data pointer is forwarded 
to the software application. After the data pointer is forwarded, at step 927, the 
5 control software examines RFQ 232 (FIG. 2) to determine if a minimum threshold is 
reached. If the number of data pointers stored in RFQ 232 is below the threshold 
level, then one or more data pointers are DMA transferred from the external memory 
to RFQ 232 for efficiency purposes as described earlier. Once the application 
software receives the data pointer, then at step 930, the application software copies 

10 the data in the data buffer pointed to by the data pointer, via a DMA transfer. At 
step 935, control software determines if the data buffer is full or if an end of packet 
has been received. If not, then the system reverts back to step 930 and continues to 
DMA transfer data to the data buffer until it is either full, or until the entire data for 
the communication packet is stored therein. Otherwise, at step 940 the application 

1 5 software generates a BD including the data pointer pointing to the data buffer. 

At step 945, application software queues the BD in TRQ 236 (FIG. 2) in 
accordance with priority schemes programmed into the system based on QoS 
requirements. In embodiments of the system, if TRQ 236 reaches a maximum 
threshold, then one or more BDs are DMA transferred to a queue in external 

20 memory to ensure that newly arrived BDs can be quickly stored in TRQ 236. At 

step 950, I/O core 290 reads the BD from TRQ 236. If TRQ 236's level falls below 
a minimum threshold, then one or more BDs stored in the external memory, if any, 
are DMA transferred to TRQ 236, as stated earlier. At step 955, I/O core 290 
returns the data pointer that was associated with the transmitted information to WFQ 

25 238. 

In one or more embodiments of the system, after the data pointer is returned 
to WFQ 238 the control program examines WFQ 238 to determine whether a 
maximum threshold limit has been reached. If so, as described earlier, one or more 
data pointers are DMA transferred to external memory, at step 957. I/O core 290 
30 forwards the packet to its destination at step 960. 
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Layer 2 Switching 



In a network environment, communication packets are handled at different 
stages of communication by different communication layers defined in the Open 
System Interconnection (OSI) model. The OSI model is a standard that defines 
5 seven layers for the modular management of various aspects of communication. 

Based on this model, data is assembled into communication packets and transmitted 
from one layer to the next, starting at the top layer in one communication node, 
proceeding to the bottom layer, over the communication channel to the next 
communication node and back up the hierarchy. In this process the communication 
1 0 packet may be segmented and reassemble into one or more different formats. Each 
packet includes information identifying a destination node (e.g., a destination 
address). 

The communication protocol that handles routing of a communication packet 
based on the physical address of source and destination nodes is referred to as Layer 

15 2 or the "data link layer." The data link layer is responsible for ensuring that the bits 
received at the destination node are the same as the bits sent from the source node. 
The IEEE 802 specification for LANs breaks the data link layer into two sublayers: 
the LLC (Logical Link Control) and MAC (Media Access Control). The LLC layer 
provides a common interface point to the MAC layers. The MAC layers specify the 

20 access method used. 

At Layer 2, the destination address of a communication packet is inspected 
by a communication controller (e.g., a bridge or a switch) for data routing purposes. 
In accordance with one or more embodiments of the system, during a Layer 2 
switching at an Ethernet port (e.g., MAC port), an I/O controller (e.g., MAC 

25 controller) filters out packets destined for the upper layers of the OSI model (i.e., the 
upper networking layers) by comparing the Ethernet address of the port with the 
destination address of the packet. The Ethernet address of a port is a unique number 
maintained by Institute of Electrical and Electronics Engineers (IEEE) New York, 
NY, and assigned to uniquely identify the port for data communications purposes. 

30 Packets with addresses that do not match the Ethernet address of the port are 
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forwarded at Layer 2 level to the destination address by the CPU's switching or 
bridging software. 

In accordance with one aspect of the system, the receiving port's Ethernet 
address is preprogrammed into a register so that it is readily available for 
comparison with the destination address of the packet, once the packet is received. 
The I/O controller performs the comparison after receiving a packet. If the 
preprogrammed Ethernet address does not match the destination address, then the 
I/O controller set one or more bits in the packet's status field, for example, to 
indicate that the packet can be forwarded at Layer 2 level without having to be 
routed to the upper networking layers. Table 6 is an example of a BD, in accordance 
with one embodiment of the system, wherein the A and B status bits can be 
configured to indicate that the destination address for the packet matches the 
programmed Ethernet address. 

TABLE 6: Exemplary BD 
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Destination Address 
Hash 


Status and Control 


B 
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Identifier 


Type 
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Source Address Hash 


Data Length 
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Data Pointer [23:16] 


Data Pointer [31:24] 


3 


Data Pointer [7:0] 


Data Pointer [15:8] 



The A and B bits can be configured by the I/O controller to designate the 
destination nodes and the manner in which a communication packet is to be 
transmitted. For example, "00" can indicate that the preprogrammed Ethernet 
address does not match the destination address, and that the packet is to be sent to a 
single node (e.g., unicast). Other bit configurations (e.g., "01", "10", or "11") can 
indicate that the preprogrammed address and the Ethernet address match and that the 
packet is to be transmitted to one or more nodes in the network (e.g., unicast, 
broadcast, or multicast). 

Once the CPU reads a BD from RRQ 234, (FIG. 2) the CPU examines the 
status bit to determine if the communication packet being processed at step 740 is 



28 



Attorney Docket: M-9564 
697678 v2 



destined for the upper networking layers. If so, the communication packet is copied 
to the data buffers maintained by the networking stack software and the data pointer 
associated with the data buffer where the communication packet (or part thereof) 
was stored is returned back to WFQ 238. 

5 In certain embodiments, each I/O port may be associated with a separate 

TRQ 236. If the CPU determines that the BD can be switched at Layer 2 level, then 
the CPU forwards the BD to TRQ 236 associated with the destination I/O port. 
After the communication packet has been forwarded, the data pointer associated 
with the BD for the packet will be returned by I/O core 290 to WFQ 232 of the I/O 
10 port where the BD was originally stored. As illustrated in FIG. 10, in embodiments 
% of the system, the buffer manager for an I/O port includes a separate WFQ for each 

V J I/O port so that freed data pointers for BDs received from the other I/O ports can be 

p returned to the appropriate RFQ from which the data pointer was initially read. 

* For example, as illustrated in FIG. 10, a communication packet received by 

a_ 15 I/O port B may be destined for transmission through I/O port A. In accordance with 
§2 one embodiment, a data pointer DPI is read from RFQ B and a buffer descriptor 

IS* BD1 is stored in RRQ B to handle the packet as received by buffer manager B. 

O After the packet is processed by buffer manager B, BD1 is copied to TRQ A from 

RRQ B in a Layer 2 switching scenario. Once the communication packet is 
20 transmitted via port A, then DPI is freed and should be returned to RFQ B, where it 
was originated. Referring to Tables 1 and 2 above, the identifier field is configured 
to identify the I/O port from which a packet was received, and therefore the RFQ 
associated with that I/O port. In accordance with one embodiment, a 2-bit field can 
identify up to four I/O ports (e.g., 00, 01, 10, 11). 

25 In accordance with one aspect of the system, buffer manager A includes a 

separate WFQ (e.g., WFQ A, WFQ B, WFQ C) for each I/O port in the system. 
Thus, DPI is copied to WFQ B once it is freed, since the identifier field designates 
I/O B as the originating I/O port. In some embodiments, if RFQ B level has not 
reached a maximum threshold, DPI can be directly copied to RFQ B via the CBUS. 

30 Otherwise, DPI is transferred from WFQ B to external memory first. DPI is DMA 
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transferred to RFQ B from the external memory via the MBUS when RFQ B level 
falls below the maximum threshold. As shown in FIG. 10, in certain embodiments, 
separate expansion buffers are allocated in external memory for each I/O port so that 
data pointers can be temporarily stored until they are transferred over to the RFQ 
5 from which they were originated. 

Expansion of Buffer Descriptors To External Memory 

In one or more embodiments of the system, to maximize processing speed it 
is desirable for buffer descriptors to be stored in on-chip memory rather than 
external memory. However, the on-chip memory dedicated to holding the buffer 
10 management queues (e.g., RRQ 234 and TRQ 236) must be kept to a reasonable size 
to be economically feasible. Due to this limitation, some of the buffer descriptors 
generated for processing communication packets in the buffer management system 
of this invention are transferred in external memory when the on-chip memory 
reaches a maximum threshold (i.e., reaches capacity). Thus, in certain embodiments 
O 15 of the system, on-chip buffer management queues (e.g., RRQ 234 and TRQ 236) are 

implemented so that when they reach a maximum threshold, any additionally 
Uj generated buffer descriptors are stored in external memory. 

z . 

FIG. 1 1 illustrates a block diagram of a buffer management queue, such as 
RRQ 234 and TRQ 236, having a write FIFO 1 105 and read FIFO 1 1 10, in 
20 accordance with one or more embodiments of the system. Write FIFO 1 105 is the 
section of the queue that is written to by I/O core 290 or the CPU. Read FIFO 1110 
is the section of the queue from which the CPU or the I/O reads buffer descriptors. 

In accordance with one aspect of the system, all BDs written to write FIFO 
1 105 are automatically moved to read FIFO 1 1 10 in a first mode, referred to as the 
25 short circuit mode. In the short circuit mode, the level of BDs stored in read FIFO 
1 1 10 is below a maximum threshold level. Thus, BDs can be transferred from write 
FIFO 1 105 to read FIFO 1110 until read FIFO 1110 reaches the maximum threshold 
(e.g., full). Once read FIFO 1110 reaches that threshold, then write FIFO 1 105 waits 
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for read FIFO 1 1 10 5 s level to fall below the maximum threshold before transferring 
any additional BDs to read FIFO 1110. 

In accordance with another aspect of the system, BDs written to write FIFO 
1 105 that are not immediately transferred to read FIFO 1110 and are queued up in 
FIFO 1 105 until FIFO 1 105 reaches a maximum threshold level. If by the time that 
FIFO 1 105 reaches FIFO 1 105's maximum threshold, FIFO 1 1 10's level has not 
fallen below FIFO 1 1 10's maximum threshold, then the system interrupts the CPU 
to initiate a DMA transfer from write FIFO 1 105 to an associated expansion buffer 
in external memory. This transfer mode is referred to as the long circuit mode. 

In the long circuit mode, the CPU initiates a DMA transfer of BDs stored in 
an associated expansion buffer in external memory to read FIFO 1110, when read 
FIFO 1 1 10's level falls below its maximum threshold. The DMA transfer continues 
until read FIFO 1110 reaches it maximum threshold, or until the expansion buffer in 
external memory is empty. Once the expansion buffer is empty, if read FIFO 1110 
has not reached a maximum threshold, the system switches back to the short circuit 
mode where BDs are directly transferred from write FIFO 1 105 to read FIFO 1110. 

ATM Segmentation And Reassembly (SAR) Interface 

ATM technology works by transmitting information in a communication 
stream that consists of fixed-length (e.g., 53-byte) cells. These fixed-length cells 
allow for fast segmentation and reassembly of communication packets, because it is 
much faster to process a known packet size than to detect the start and end of 
variable length packets. Due to the small size of ATM cells, voice and video can be 
inserted into an ATM communication stream at high frequency for dependable and 
real-time transmission. 

One or more embodiments of the system can be configured to provide an 
ATM Segmentation and Reassembly (SAR) I/O interface. This interface imposes 
special requirements of the buffer management system because in contrast to other 
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I/O interfaces, the ATM interface, for example, supports up to 32 different virtual 
circuits (VCs) and can include interleaved packets arriving from various VCs. A 
VC is a logical circuit within a physical communication network that provides for a 
communication path for transmission of communication packets. A communication 
packet transmitted over the data link layer (i.e., Layer 2) is sometimes referred to as 
a protocol data unit (PDU). A PDU includes one or more ATM cells that may arrive 
and end at different periods depending on the network's QoS requirements. 

Embodiments of the invention are implemented so that QoS in an ATM 
stream can be specified, allowing for voice and video to be transmitted in steady 
successive intervals, thereby providing a smooth media experience. The system 
supports CBR, rt-VBR, nrt-VBR, ABR, and UBR levels of service. Constant Bit 
Rate (CBR) guarantees bandwidth for real-time voice and video. Realtime variable 
Bit Rate (rt-VBR) supports interactive multimedia that requires minimal delays. 
Non-realtime variable bit rate (nrt-VBR) is used for bursty transaction traffic. 
Available Bit Rate (ABR) adjusts bandwidth according to congestion levels for LAN 
traffic. Unspecified Bit Rate (UBR) provides a best effort for non-critical data such 
as file transfers. 

The ATM Adaption Layer (AAL) is the part of the ATM protocol that breaks 
up application packets into 48-byte payloads, for example. The 48-byte payloads 
are assembled into ATM cells. In addition to the payload, the ATM cell also 
includes a 5-byte header, for a total of 53 bytes per cell. The AAL resides between 
the higher layer transport protocols and the ATM layer. AAL-1 is used for CBR 
service; AAL-2 for VBR; AAL-3/4 for ABR; and AAL-5 for UBR. 

In embodiments of the system, each PDU can be up to 64KB in size. 
Therefore, a single PDU (e.g., an AAL-5 frame) may need to be stored in more than 
one data buffer and therefore associated with more than one BD. In embodiments of 
the system, separate receive BD queues are allocated for each VC, so that multiple 
BDs for a single PDU can be grouped in separate and distinct queues. As illustrated 
in FIG. 12, I/O core 290 writes BDs to RRQ 234 one after another as PDUs are 
received. Since the system supports more than one VC with PDUs arriving at 
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different time intervals, there is a need for a mechanism to track each BD generated 
for each PDU cell in association with the particular VC. 

In accordance with one aspect of the system, as illustrated in Table 1 , a BD 
for an ATM packet includes a virtual circuit channel number (i.e., channel ID) and a 
5 type field. The channel numbers for different VCs are configured by the buffer 

manager driver during configuration. Upon receipt of a packet, the system detects a 
VC associated with the packet based on the virtual circuit identifier (V CI) and the 
virtual path identifier (VPI) of the VC and includes in the BD a virtual circuit 
channel number to identify the VC associated with the received packet. 

The type field, as discussed earlier, indicates whether a BD is the first, 
middle, or the last BD in a chain of BDs. Table 3 above provides an exemplary 
embodiment of bit configurations that are used in one or more embodiments of the 
invention to identify the location of a BD in a BD chain. Referring to FIG. 12, BDs 
associated with one or more VCs are stored in RRQ 234 as the PDUs are received 
by I/O core 290. 

Embodiments of the system are implemented to include one or more BD 
ready queues (BRQs) in external memory in association with each VC. Thus, for 
example, in an embodiment of the system that supports 32 VCs, 32 BRQs are 
implemented in external memory (not shown). BDs written to RRQ 234 are sorted 
by the CPU into separate BRQs based on their channel numbers, during processing. 

FIG. 1 3 illustrates a flow diagram of a method of sorting BDs into respective 
BRQs in external memory during processing and transferring communication 
packets to higher communication layers. At step 1310, the CPU waits to receive an 
interrupt signal to read a BD from RRQ 234. At step 1320, CPU reads BD from 
25 RRQ 234. At step 1330, the CPU examines the type field in the BD to determine if 
the BD is the first, middle, or last BD for a communication packet. 

If the type field for the BD indicates that the BD is associated with a single 
communication packet, then at step 1345 the CPU examines the BD's channel 
number and transfers the BD to the associated BRQ for the identified VC. If the 
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type field indicates that the BD is a first or middle BD in a chain of BDs, then at 
steps 1340 and 1350, respectively, the CPU DMA transfers the BD from RRQ 234 
to the BRQ identified by the channel number of each BD. 

If the type field for the BD indicates that the BD is the last BD in the chain 
of BDs, then at step 1360, the CPU DMA transfers the BD into the BRQ identified 
by the channel number of that BD and notifies the higher layer application that a 
packet is received. Else, at step 1355, the CPU drops the BD and returns the data 
pointer for the BD to the appropriate WRQ, as described earlier. 

In embodiments of the system, the size of data buffers can be configured at 
various capacities. For example, with a buffer size of 512 bytes approximately 10 
48-byte cells can be stored in each data buffer. Assuming an I/O data speed of 8 
Mbps, for example, the expected rate of interrupts would be approximately 2000 
(8Mbps x 1 Byte/8 Bits x 1 ISR/512 bytes) ISRs per second, in accordance with one 
or more embodiments of the system. 

As such, a system and method for managing data buffers in a communication 
system is described in conjunction with one or more embodiments. It should be 
understood, however, the embodiments disclosed here are provided by way of 
example. Other methods or system architectures and implementations may be 
utilized. These and various adaptations and combinations of features of the 
embodiments disclosed are within the scope of the invention. The invention is 
defined by the following claims and their full scope of equivalents. 
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